144 research outputs found

    Instruction-set customization for multi-tasking embedded systems

    Get PDF
    Ph.DDOCTOR OF PHILOSOPH

    Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor

    Full text link
    With the ease-of-programming, flexibility and yet efficiency, MapReduce has become one of the most popular frameworks for building big-data applications. MapReduce was originally designed for distributed-computing, and has been extended to various architectures, e,g, multi-core CPUs, GPUs and FPGAs. In this work, we focus on optimizing the MapReduce framework on Xeon Phi, which is the latest product released by Intel based on the Many Integrated Core Architecture. To the best of our knowledge, this is the first work to optimize the MapReduce framework on the Xeon Phi. In our work, we utilize advanced features of the Xeon Phi to achieve high performance. In order to take advantage of the SIMD vector processing units, we propose a vectorization friendly technique for the map phase to assist the auto-vectorization as well as develop SIMD hash computation algorithms. Furthermore, we utilize MIMD hyper-threading to pipeline the map and reduce to improve the resource utilization. We also eliminate multiple local arrays but use low cost atomic operations on the global array for some applications, which can improve the thread scalability and data locality due to the coherent L2 caches. Finally, for a given application, our framework can either automatically detect suitable techniques to apply or provide guideline for users at compilation time. We conduct comprehensive experiments to benchmark the Xeon Phi and compare our optimized MapReduce framework with a state-of-the-art multi-core based MapReduce framework (Phoenix++). By evaluating six real-world applications, the experimental results show that our optimized framework is 1.2X to 38X faster than Phoenix++ for various applications on the Xeon Phi

    Probabilistic Multilevel Clustering via Composite Transportation Distance

    Full text link
    We propose a novel probabilistic approach to multilevel clustering problems based on composite transportation distance, which is a variant of transportation distance where the underlying metric is Kullback-Leibler divergence. Our method involves solving a joint optimization problem over spaces of probability measures to simultaneously discover grouping structures within groups and among groups. By exploiting the connection of our method to the problem of finding composite transportation barycenters, we develop fast and efficient optimization algorithms even for potentially large-scale multilevel datasets. Finally, we present experimental results with both synthetic and real data to demonstrate the efficiency and scalability of the proposed approach.Comment: 25 pages, 3 figure

    Neural Sinkhorn Topic Model

    Full text link
    In this paper, we present a new topic modelling approach via the theory of optimal transport (OT). Specifically, we present a document with two distributions: a distribution over the words (doc-word distribution) and a distribution over the topics (doc-topic distribution). For one document, the doc-word distribution is the observed, sparse, low-level representation of the content, while the doc-topic distribution is the latent, dense, high-level one of the same content. Learning a topic model can then be viewed as a process of minimising the transportation of the semantic information from one distribution to the other. This new viewpoint leads to a novel OT-based topic modelling framework, which enjoys appealing simplicity, effectiveness, and efficiency. Extensive experiments show that our framework significantly outperforms several state-of-the-art models in terms of both topic quality and document representations

    Topic Modelling Meets Deep Neural Networks: A Survey

    Full text link
    Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with over a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to summarise research developments and discuss open problems and future directions. In this paper, we provide a focused yet comprehensive overview of neural topic models for interested researchers in the AI community, so as to facilitate them to navigate and innovate in this fast-growing research area. To the best of our knowledge, ours is the first review focusing on this specific topic.Comment: A review on Neural Topic Model

    Visual and biochemical evidence of glycocalyx disruption in human dengue infection, and association with plasma leakage severity

    Get PDF
    Background: Dengue is the most common arboviral infection globally; a minority of patients develop shock due to profound plasma leak through a disrupted endothelial barrier. Understanding of the pathophysiology underlying plasma leak is incomplete, but emerging evidence indicates a key role for degradation of the endothelial glycocalyx. Methods: We conducted an observational study in Vietnam to evaluate the sublingual microcirculation using sidestream darkfield imaging in (1) outpatients with confirmed dengue (2) patients hospitalized with dengue and (3) outpatients with other febrile illness (OFI). We estimated the glycocalyx degradation by measuring the perfused boundary region (PBR hf) and an overall microvascular health score (MVHS) with the software application GlycoCheckTM at enrolment, 48 h later and hospital discharge/defervescence. We measured plasma syndecan1 and endocan at the same time-points. We compared PBR hf, MVHS, syndecan1 and endocan, between (1) outpatients with confirmed dengue vs. OFI and (2) patients with dengue subdivided by clinical severity of plasma leak. Results: We included 75 patients with dengue (41 outpatients, 15 inpatients, 19 in intensive care) and 12 outpatients with OFI. Images from 45 patients were analyzed using GlycoCheckTM. There was no significant difference in PBR hf or MVHS between outpatients with dengue and OFI. Median plasma syndecan1 was not significantly different in outpatients with dengue vs. OFI, while median plasma endocan was significantly lower among patients with dengue vs. OFI during the critical phase. In patients with dengue, PBR hf was higher in patients with Grade 2 vs. Grade 0 plasma leakage during the critical phase (PBR hf 1.96 vs. 1.36 μm for Grade 2 vs. Grade 0 plasma leakage on days 4–6, respectively, p < 0.001). Median levels of plasma syndecan1 and endocan were higher in Grade 2 vs. Grade 0 plasma leakage, especially during the critical phase (Syndecan1 2,613.8 vs. 125.9 ng/ml for Grade 2 vs. Grade 0 plasma leakage on days 4–6, respectively, p < 0.001, and endocan 3.21 vs. 0.16 ng/ml for Grade 2 vs. Grade 0 plasma leakage on days 4–6, respectively). Conclusions: We present the first human in vivo evidence of glycocalyx disruption in dengue, with worse visual glycocalyx damage and higher plasma degradation products associated with more severe plasma leak
    corecore